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ABSTRACT 

We survey results of a quarter century of work on computation by 
reversible general-purpose computers (in this setting Turing ma- 
chines), and general reversible simulation of irreversible computa- 
tions, with respect to energy-, time- and space requirements. 

Categories and Subject Descriptors: F.2 [Algorithms], F.1.3 
[Performance] 

General Terms: Algorithms, Performance 

Keywords: Reversible computing, reversible simulation, adia- 
batic computing, low-energy computing, computational complex- 
ity, time complexity, space complexity, energy dissipation com- 
plexity, tradeoffs. 

1. INTRODUCTION 

Computer power has roughly doubled every 18 months for the 
last half-century (Moore's law). This increase in power is due pri- 
marily to the continuing miniaturization of the elements of which 
computers are made, resulting in more and more elementary gates 
per unit area with higher and higher clock frequency, accompa- 
nied by less and less energy dissipation per elementary computing 
event. Roughly, a linear increase in clock speed is accompanied 
by a square increase in elements per unit area — so if all elements 
compute all of the time, then the dissipated energy per time unit 
rises cubicly (linear times square) in absence of energy decrease 
per elementary event. The continuing dramatic decrease in dissi- 
pated energy per elementary event is what has made Moore's law 
possible. But there is a foreseeable end to this: There is a minimum 
quantum of energy dissipation associated with elementary events. 
This puts a fundamental limit on how far we can go with miniatur- 
ization, or does it? 

R. Landauer 1 1 1 1 has demonstrated that it is only the 'logically 
irreversible' operations in a physical computer that necessarily dis- 
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sipate energy by generating a corresponding amount of entropy for 
every bit of information that gets irreversibly erased; the logically 
reversible operations can in principle be performed dissipation-free. 
One should sharply distinguish between the issue of logical re- 
versibility and the issue of energy dissipation freeness. If a com- 
puter operates in a logically reversible manner, then it still may dis- 
sipate heat. For such a computer we know that the laws of physics 
do not preclude that one can invent a technology in which to im- 
plement a logically similar computer to operate physically in a dis- 
sipationless manner. Computers built from reversible circuits, or 
the reversible Turing machine, Q |2| [7), implemented with cur- 
rent technology will presumably dissipate energy but may conceiv- 
ably be implemented by future technology in an adiabatic fashion. 
But for logically irreversible computers adiabatic implementation 
is widely considered impossible. 

Thought experiments can exhibit a computer that is both logi- 
cally and physically perfectly reversible and hence perfectly dissi- 
pationless. An example is the billiard ball computer, |7j, and simi- 
larly the possibility of a coherent quantum computer, (61 1191 . Our 
purpose is to determine the theoretical ultimate limits to which the 
irreversible actions in an otherwise reversible computation can be 
reduced. 

Currently, computations are commonly irreversible, even though 
the physical devices that execute them are fundamentally reversible. 
At the basic level, however, matter is governed by classical me- 
chanics and quantum mechanics, which are reversible. This con- 
trast is only possible at the cost of efficiency loss by generating 
thermal entropy into the environment. With computational device 
technology rapidly approaching the elementary particle level it has 
been argued many times that this effect gains in significance to the 
extent that efficient operation (or operation at all) of future comput- 
ers requires them to be reversible, II ll lll 121171 1101^151 181. The mis- 
match of computing organization and reality will express itself in 
friction: computers will dissipate a lot of heat unless their mode of 
operation becomes reversible, possibly quantum mechanical. Since 
1940 the dissipated energy per bit operation in a computing device 
has — with remarkable regularity — decreased at the inverse rate of 
Moore's law |10| (making Moore's law possible). Extrapolation 
of current trends shows that the energy dissipation per binary logic 
operation needs to be reduced below kT (thermal noise) within 20 
years. Here k is Boltzmann's constant and T the absolute tem- 
perature in degrees Kelvin, so that kT ~ 3 x 10~ 21 Joule at room 
temperature. Even at kT level, a future device containing 1 trillion 
(10 12 ) gates operating at 1 terahertz (10 12 ) switching all gates all 
of the time dissipates about 3000 watts. Consequently, in contem- 
porary computer and chip architecture design the issue of power 
consumption has moved from a background worry to a major prob- 
lem. Theoretical advances in reversible computing are scarce and 



far between; many serious ones are listed in the references. For this 
review we have drawn primarily on the results and summarized ma- 
terial in I15II17II51 . It is a tell-tale sign of the difficulty of this area, 
that no further advances have been made in this important topic 
since that time. 

2. REVERSIBLE TURING MACHINES 

There is a decisive difference between reversible circuits and re- 
versible special purpose computers (7) on the one hand, and re- 
versible universal computers on the other hand 1 1 3 1. While one 
can design a special-purpose reversible version for every particular 
irreversible circuit using reversible universal gates, such a method 
does not yield an irreversible-to-reversible compiler that can exe- 
cute any irreversible program on a fixed universal reversible com- 
puter architecture as we are interested in here. 

In the standard model of a Turing machine the elementary oper- 
ations are rules in quadruple format (p,s,a,q) meaning that if the 
finite control is in state p and the machine scans tape symbol s, then 
the machine performs action a and subsequently the finite control 
enters state q. Such an action a consists of either printing a symbol 
s 1 in the tape square scanned, or moving the scanning head one tape 
square left or right. 

Quadruples are said to overlap in domain if they cause the ma- 
chine in the same state and scanning the same symbol to perform 
different actions. A deterministic Turing machine is defined as a 
Turing machine with quadruples no two of which overlap in do- 
main. 

Now consider the special format (deterministic) Turing machines 
using quadruples of two types: read/write quadruples and move 
quadruples. A read/write quadruple (p,a,b,q) causes the machine 
in state p scanning tape symbol a to write symbol b and enter 
state q. A move quadruple (p,*,a,q) causes the machine in state 
p to move its tape head by O 6 {— 1,+1} squares and enter state 
q, oblivious to the particular symbol in the currently scanned tape 
square. (Here '— 1' means 'one square left', and '+1' means 'one 
square right'.) Quadruples are said to overlap in range if they cause 
the machine to enter the same state and either both write the same 
symbol or (at least) one of them moves the head. Said differently, 
quadruples that enter the same state overlap in range unless they 
write different symbols. A reversible Turing machine is a deter- 
ministic Turing machine with quadruples no two of which overlap 
in range. A /r-tape reversible Turing machine uses (2k + 2) tuples 
which, for every tape separately, select a read/write or move on that 
tape. Moreover, any two tuples can be restricted to some single tape 
where they don't overlap in range. 

Simulation of irreversible Turing machines by reversible ones 
goes back to Lecerf|13|inl963, a paper that was little noticed and 
only recently rediscovered, and independently Bennett 1 1 ] in 1973, 
which paper commonly gets all the credit. To show that every par- 
tial recursive function can be computed by a reversible Turing ma- 
chine one can proceed as follows 1 1 1. Take the standard irreversible 
Turing machine computing that function. We modify it by adding 
an auxiliary storage tape called the 'history tape'. The quadruple 
rules are extended to 6-tuples to additionally manipulate the his- 
tory tape. To be able to reversibly undo (retrace) the computation 
deterministically, the new 6-tuple rules have the effect that the ma- 
chine keeps a record on the auxiliary history tape consisting of the 
sequence of quadruples executed on the original tape. Reversibly 
undoing a computation entails also erasing the record of its execu- 
tion from the history tape. This notion of reversible computation 
means that only 1 : 1 recursive functions can be computed. To re- 
versibly simulate an irreversible computation from x to f(x) one 
reversibly computes from input x to output (x,f(x)}. 



Reversible Turing machines or other reversible computers will 
require special reversible programs. One feature of such programs 
is that they should be executable when read from bottom to top 
as well as when read from top to bottom. Examples are the pro- 
grams in the later sections. In general, writing reversible programs 
will be difficult. However, given a general reversible simulation 
of irreversible computation, one can simply write an oldfashioned 
irreversible program in an irreversible programming language, and 
subsequently simulate it reversibly. This leads to the following: 

DEFINITION 1. An irreversible-to-reversible compiler receives 
an irreversible program as input and compiles it to a reversible 
program. 

3. ADIABATIC COMPUTATION 

All computations can be performed logically reversibly, [T\, at 
the cost of eventually filling up the memory with unwanted garbage 
information. This means that reversible computers with bounded 
memories require in the long run irreversible bit operations, for ex- 
ample, to erase records irreversibly to create free memory space. 
The minimal possible number of irreversibly erased bits to do so is 
believed to determine the ultimate limit of heat dissipation of the 
computation by Landauer's principle, II ll lT1[2). In reference |4] 
we and others developed a mathematical theory for the unavoid- 
able number of irreversible bit operations in an otherwise reversible 
computation. 

Methods to implement (almost) reversible and dissipationless 
computation using conventional technologies are often designated 
by the catch phrase 'adiabatic switching'. Many currently proposed 
physical schemes implementing adiabatic computation reduce irre- 
versibility by using longer switching times. This is done typically 
by switching over equal voltage gates after voltage has been equal- 
ized slowly. This type of switching does not dissipate energy, the 
only energy dissipation is incurred by pulling voltage up and down: 
the slower it goes the less energy is dissipated. If the computation 
goes infinitely slow, zero energy is dissipated. Clearly, this coun- 
teracts the purpose of low energy dissipation which is faster com- 
putation. 

In 1 15 1 it is demonstrated that even if adiabatic computation tech- 
nology advances to switching with no time loss, a similar phe- 
nomenon arises when we try to approach the ultimate limits of 
minimal irreversibility of an otherwise reversible computation, and 
hence minimal energy dissipation. This time the effect is due to the 
logical method of reducing the number of irreversible bit erasures 
in the computation irrespective of individual switching times. By 
computing longer and longer (in the sense of using more compu- 
tation steps), the amount of dissipated energy gets closer to ulti- 
mate limits. Moreover, one can trade-off time (number of steps) 
for energy: there is a new time-irreversibility (time-energy) trade- 
off hierarchy. The bounds we derive are also relevant for quantum 
computations which are reversible except for the irreversible obser- 
vation steps, I20II19I . 

3.1 Background 

Around 1940 a computing device dissipated about 10~ 2 loule 
per bit operation at room temperature. Since that time the dissi- 
pated energy per bit operation has roughly decreased by one order 
of magnitude (tenfold) every five years. Currently, a bit operation 
dissipates 1 about 10~ 17 Joule. 

Considerations of thermodynamics of computing started in the 
early fifties. J. von Neumann reputedly thought that a computer 

'After R.W. Keyes, IBM Research. 



operating at temperature T must dissipate at least Win 2 Joule per 
elementary bit operation (about 3 x 10~ 21 J at room temperature). 

Around 1960, R. Landauer 1 11 ] more thoroughly analyzed this 
question and concluded that it is only 'logically irreversible' oper- 
ations that dissipate energy. An operation is logically reversible if 
its inputs can always be deduced from the outputs. Erasure of in- 
formation in a way such that it cannot be retrieved is not reversible. 
Erasing each bit costs Win 2 energy, when computer operates at 
temperature T . 

3.2 Irreversibility Cost of Computation 

The ultimate limits of energy dissipation by computation will be 
expressed in number of irreversibly erased bits. Hence we consider 
compactification of records. In analogy of garbage collection by a 
garbage truck, the cost is less if we compact the garbage before we 
throw it away. The ultimate compactification which can be effec- 
tively exploited is expressed in terms of Kolmogorov complexity. 

Technically, the Kolmogorov complexity ofx given y is the length 
of the shortest binary program, for the reference universal prefix 
Turing machine, that on input y outputs x; it is denoted as C(x\y). 
For precise definitions, theory and applications, see 1 16 1. The Kol- 
mogorov complexity of x is the length of the shortest binary pro- 
gram with no input that outputs x; it is denoted as C(x) = C(x\X) 
where X denotes the empty input. Essentially, the Kolmogorov 
complexity of a file is the length of the ultimate compressed ver- 
sion of the file. 

Let R = R\ ,i?2, • • ■ be a standard enumeration of reversible Tur- 
ing machines, [ 1 ]. We define £(•,•) as m |4| (where it is denoted 
as £3 (•,•))• 

DEFINITION 2. The irreversibility cost Er(x, y) of computing y 
from x by a reversible Turing machine R is is 

E R (x,y) = mm{\p\ + \q\ : R({x,p)) = (y,q)}. 

We denote the class of all such cost functions by E. 

We call an element Eq of £ a universal irreversibility cost func- 
tion, if Q € R, and for all R in R 

E Q{x,y) <E R {x,y)+c R , 

for all x and y, where cr is a constant which depends on R but not 
on x or y. Standard arguments from the theory of Turing machines 
show the following. 

LEMMA 1. There is a universal irreversibility cost function in 
■£. Denote it by Eur. 

Proof. In Q a universal reversible Turing machine UR is con- 
structed which satisfies the optimality requirement. □ 

Two such universal (or optimal) machines UR and UR' will as- 
sign the same irreversibility cost to a computation apart from an 
additive constant term c which is independent of x and y (but does 
depend on UR and UR'). We select a reference universal function 
UR and define the irreversibility cost E(x,y) of computing y from 
x as 

E(x,y) = E UR (x,y). 

Because of the expression for E(x,y) in Theorem Q below it is 
called the sum cost measure in 1 4 1. 

In physical terms this cost is in units of £7Tn2, where k is Boltz- 
mann's constant, T is the absolute temperature in degrees Kelvin, 
and In is the natural logarithm. 

Because the computation is reversible, this definition is symmet- 
ric: we have E(x,y) = E(y,x). 



In our definitions we have pushed all bits to be irreversibly pro- 
vided to the start of the computation and all bits to be erased to the 
end of the computation. It is easy to see that this is no restriction. If 
we have a computation where irreversible acts happen throughout 
the computation, then we can always mark the bits to be erased, 
waiting with actual erasure until the end of the computation. Simi- 
larly, the bits to be provided can be provided (marked) at the start of 
the computation while the actual reading of them (simultaneously 
unmarking them) takes place throughout the computation). 

Now let us consider a general computation which outputs string 
y from input string x. We want to know the minimum irreversibility 
cost for such computation. This leads to the following theorem, for 
two different proofs see | 4"||15I , 

Theorem 1 (Fundamental Theorem). Up to an additive 
logarithmic term 

E(x,y)=C(x\y)+C(y\x). 

Erasing a record x is actually a computation from x to the empty 
string £. Hence its irreversibility cost is E(x,z). 

COROLLARY 1. Up to a logarithmic additive term, the irre- 
versibility cost of erasure is E(x,e) = C(x). 

3.3 Trading Time for Energy 

Because now the time bounds are important we consider the uni- 
versal Turing machine U to be the machine with two work tapes 
which can simulate t steps of a multitape Turing machine T in 
0(rlog?) steps (the Hennie-Stearns simulation). If some multitape 
Turing machine T computes x in time t from a program p, then U 
computes x in time 0(t\ogt) from p plus a description of T. 

DEFINITION 3. Let C'(x\y) be the minimal length of binary 
program (not necessarily reversibly) for the two work tape uni- 
versal Turing machine U computing x given y (for free) in time 
t. Formally, 

C(x\y) = min{|p| : U((p,y}) =xin < t(\x\) steps}. 

pen. 

C' (x\y) is called the f-time-limited conditional Kolmogorov com- 
plexity ofx given y. The unconditional version is defined as C 1 (x) := 
C'(x,e). A program p such that U(p) = x in < t(\x\) steps and 
\p\ = C (x) is denoted as x*. 

Note that with C' T (x\y) the conditional f-time-limited Kolmogorov 
complexity with respect to Turing machine T, for all x,y, C' (x\y) < 
C' T (x\y) + cj, where t' = O(flogf) and cj is a constant depending 
on T but not on x and y. 

This C' (•) is the standard definition of time-limited Kolmogorov 
complexity, 1 16 1. However, in the remainder of the paper we al- 
ways need to use reversible computations. Fortunately, in |3J it is 
shown that for any £ > 0, ordinary multitape Turing machines using 
T time and S space can be simulated by reversible ones using time 
0(T) and space 0(ST £ ). 

To do effective erasure of compacted information, we must at 
the start of the computation provide a time bound /. Typically, t is 
a recursive function and the complexity of its description is small, 
say 0(1). However, in Theorem[2]we allow for very large running 
times in order to obtain smaller C'(-) values. 

Theorem 2 (Effective Erasure). Ift(\x\) > \x\ is a time 
bound which is provided at the start of the computation, then eras- 
ing an n bit record x by an otherwise reversible computation can be 
done in time (number of steps) 0(2 ]>x]l t(\x\)) at irreversibility cost 



(hence also thermodynamic cost) C'(x) + 2C'(/|x) + 41ogC f (t\x) 
bits. (Typically we consider t as some standard explicit time bound 
and the last two terms adding up to 0(1).) 

PROOF. Initially we have in memory input x and a program p of 
length C'(t,x) to compute reversibly / from x. To separate binary x 
and binary p we need to encode a delimiter in at most 21ogC*(f|jc) 
bits. 

1. Use x and p to reversibly compute t. Copy t and reverse the 
computation. Now we have x, p and t. 

2. Use / to reversibly dovetail the running of all programs of 
length less than x to find the shortest one halting in time t 
with output x. This is x*. The computation has produced 
garbage bits g(x,x*). Copy xf, and reverse the computation 
to obtain x erasing all garbage bits g(x,x*). Now we have 
x,p,xf ,t in memory. 

3. Reversibly compute t from x by p, cancel one copy of t, and 
reverse the computation. Now we have x, p,x^~ in memory. 

4. Reversibly cancel x using x* by the standard method, and 
then erase x* and p irreversibly. 

□ 

Corollary 2. E(x,e) > lim ( ^coC'(x) =C(x), and by Theo- 
remU]up to an additional logarithmic term, E(x,z) = C(x). 

Essentially, by spending more time we can reduce the thermody- 
namic cost of erasure of x* to its absolute minimum. In the limit we 
spend the optimal value C(x) by erasing x*, since lim,^ooX* = x*. 
This suggests the existence of a trade-off hierarchy between time 
and energy. The longer one reversibly computes to perform final ir- 
reversible erasures, the less bits are erased and energy is dissipated. 
This intuitive assertion will be formally stated and rigourously proved 
below. 

DEFINITION 4. Let UR be the reversible version of the two 
worktape universal Turing machine, simulating the latter in linear 
time by Bennett's result mentioned above. Let E'(x,y) be the min- 
imum irreversibility cost of an otherwise reversible computation 
from x to y in time t . Formally, 

E'(x,y) = min {\p\ + \q\ : UR({x,p)) = (y,q) in < t(\x\) steps}. 

Because of the similarity with Corollary|2|(£(^, e) is about C(x)) 
one is erroneously led to believe that E' (x, e) = C' (x) up to a log ad- 
ditive term. However, the time-bounds introduce many differences. 
To reversibly compute xf we may require (because of the halting 
problem) at least 0(2Mf(|;c|)) steps after having decoded t, as in- 
deed is the case in the proof of Theorem|2| In contrast, E'(x,e) is 
about the number of bits erased in an otherwise reversible compu- 
tation which uses at most t steps. Therefore, as far as we know pos- 
sibly C{x) > E''(x,e) implies t' = a(2^t(\x\)). More concretely, 
it is easy to see that for each x and t(\x\) > \x\, 

E'(x,z) >C(x) >E''(x,e)/2, (1) 

with t'(\x\) = 0(t(\x\). Theorem[2]can be restated in terms of £'(•) 
as 

E''{x,B) < C'(x) + 2C'(t\x) + 41ogC f (/|x), 

with t'(\x\) = 0(2^t(\x\)). Comparing this to the righthand in- 
equality of EquationQwe have improved the upper bound on era- 
sure cost at the expense of increasing erasure time. However, these 



bounds only suggest but do not actually prove that we can exchange 
irreversibility for time. The following result definitely establishes 
the existence of a trade-off, 1151 . 

Theorem 3 (A Trade-Off Hierarchy). For each large 
enough n there is a string x of length n and a sequence ofm = j ^fn 
time functions t\(n) < t%(n) < ... < t m (n), such that 

E'< (x, e) > E' 2 (x, e) > . . . > E'- (x, e) . 

In the cost measures like E'(-,-) we have counted both the irre- 
versibly provided and the irreversibly erased bits. But Landauer's 
principle only charges energy dissipation costs for irreversibly erased 
bits. It is conceivable that the above results would not hold if one 
considers as the cost of erasure of a record only the irreversibly 
erased bits. However, we have show that Theorem[3]also holds for 
Landauer's dissipation measure, 1151 . in exactly the same form and 
by almost the same proof. 

4. OUTLINE SIMULATION RESULTS 

Currently, almost no algorithms and other programs are designed 
according to reversible principles (and in fact, most tasks like com- 
puting Boolean functions are inherently irreversible). To write re- 
versible programs by hand is unnatural and difficult. The natural 
way is to compile irreversible programs to reversible ones. This 
raises the question about efficiency of general reversible simulation 
of irreversible computation. Let us briefly summarize the research 
reviewed in the next sections. 

Suppose the irreversible computation to be simulated uses T time 
and 5 space. A first efficient method was proposed by Bennett 1 3 1, 
but it is space hungry and uses 2 time ST logi and space SlogT. If T 
is maximal, that is, exponential in 5, then the space use is S 2 . This 
method can be modeled by a reversible pebble game. Reference 
1 17 1 demonstrated that Bennett's method is optimal for reversible 
pebble games and that simulation space can be traded off against 
limited erasing. In 1 12 1 it was shown that using a method by Sipser 
1 21 1 one can reversibly simulate using only 0(5) extra space but at 
the cost of using exponential time. In 1 9 1 a relativized separation of 
reversible and irreversible space-time complexity classes is given. 

These previous results seem to suggest that a reversible simu- 
lation is stuck with either quadratic space use or exponential time 
use. This impression turns out to be false |5 1: 

There is a tradeoff between time and space which has the expo- 
nential time simulation and the quadratic space simulation as ex- 
tremes and for the first time gives a range of simulations using 
simultaneously subexponential (2/M is subexponential if /(«) = 
o(n)) time and subquadratic space. The idea is to use Bennett's 
pebbling game where the pebble steps are intervals of the simu- 
lated computation that are bridged by using the exponential simu- 
lation method. (It should be noted that embedding Bennett's peb- 
bling game in the exponential method gives no gain, and neither 
does any other iteration of embeddings of simulation methods.) 
Careful analysis shows that the simulation using k pebbles takes 
T' := S3 k 2°( T l 2 ' time and 5' = 0(kS) space, and in some cases 
the upper bounds are tight. For £ = we have the exponential time 
simulation method and for k = logT we have Bennett's method. 
Interesting values arise for say 

(a) k = loglogT which yields T 1 = S , (logr) lo s 3 2°( 7 V lo £ 7 ') and 
5'=51oglo g r<51og5; 

2 By judicious choosing of simulation parameters this method can 
be tweaked to run in Sr I+£ time for every e > at the cost of intro- 
ducing a multiplicative constant depending on 1 /e. The complexity 
analysis of (3J was completed in 1141 . 



(b) for k = yflogT we have S' = S^logT < SVS and T' = 

2 0(T/2-y^ T ) 

(c) Let T,S, T',S' be as above. Eliminating the unknown k shows 
the tradeoff between simulation time T 1 and extra simulation space 

5': T'=53T2°( 7 '/ 2 ^). 

(d) Let T,S,T',S' be as above and let the irreversible compu- 
tation be halting and compute a function from inputs of n bits to 
outputs. For general reversible simulation by a reversible Turing 
machine using a binary tape alphabet and a single tape, 5' > n + 
log T + 0(1) and T' >T. This lower bound is optimal in the sense 
that it can be achieved by simulations at the cost of using time ex- 
ponential in S. 

Main open problem: The ultimate question is whether one can 
do better, and obtain improved upper and lower bounds on the 
tradeoff between time and space of reversible simulation, and in 
particular whether one can have almost linear time and almost lin- 
ear space simultaneously. 

5. TIME PARSIMONIOUS SIMULATION 

5.1 Reversible Programming 

Reversible Turing machines or other reversible computers will 
require special reversible programs. One feature of such programs 
is that they should be executable when read from bottom to top as 
well as when read from top to bottom. Examples are the programs 
we show in the later sections. In general, writing reversible pro- 
grams will be difficult. However, given a general reversible sim- 
ulation of irreversible computation, one can simply write an old- 
fashioned irreversible program in an irreversible programming lan- 
guage, and subsequently simulate it reversibly. This leads to the 
following: 

DEFINITION 5. An irreversible-to-reversible compiler receives 
an irreversible program as input and reversibly compiles it to a 
reversible program. Subsequently, the reversible program can be 
executed reversibly. 

Note that there is a decisive difference between reversible circuits 
and reversible special purpose computers on the one hand, and re- 
versible universal computers on the other hand. While one can de- 
sign a special-purpose reversible version for every particular irre- 
versible circuit using reversible universal gates, such a method does 
not yield an irreversible-to-reversible compiler that can execute any 
irreversible program on a fixed universal reversible computer archi- 
tecture as we are interested in here. 

5.2 Initial Reversible Simulations 

The reversible simulation in [ 1 ] of T steps of an irreversible com- 
putation from x to f(x) reversibly computes from input x to output 
(x,f(x)) in T' = 0(T) time. However, since this reversible simu- 
lation at some time instant has to record the entire history of the 
irreversible computation, its space use increases linearly with the 
number of simulated steps T. That is, if the simulated irreversible 
computation uses S space, then for some constant c > 1 the simu- 
lation uses T 1 ~ c + cT time and 5' ~ c + c(S +T) space. This can 
be an unacceptable amount of space for many practically useful 
computations. 

In 1 3 1 another elegant simulation technique is devised reducing 
the auxiliary storage space. This simulation does not save the entire 
history of the irreversible computation but it breaks up the sim- 
ulated computation into segments of about S steps and saves in 
a hierarchical manner checkpoints consisting of complete instan- 
taneous descriptions of the simulated machine (entire tape con- 



tents, tape heads positions, state of the finite control). After a 
later checkpoint is reached and saved, the simulating machine re- 
versibly undoes its intermediate computation, reversibly erasing the 
intermediate history and reversibly canceling the previously saved 
checkpoint. Subsequently, the computation is resumed from the 
new checkpoint onwards. It turns out that this can be done us- 
ing limited time r lo s 3 and space SlogT. Fine-tuning the method 
goes as follows: The reversible computation simulates k" segments 
of length m of irreversible computation in (2k— 1)" segments of 
length ®(m + S) of reversible computation using n(k— 1) + 1 check- 
point registers using &(m + S) space each, for every k,n,m. 

This way it is established that there are various tradeoffs possi- 
ble in time-space in between T 1 = &(T) and S' = ®(TS) at one ex- 
treme (k = l,m = T,n = 1) and (with the corrections of 1 14 1) T 1 = 
0(r 1+ 75 E ) and 5" = 0(c(e)5(l +logT/5)) with c(e) = £2 1 / e for 
every e > 0, using always the same simulation method but with dif- 
ferent parameters k, n where e = log^,(2A: — 1) and m = &(S). Typ- 
ically, for k = 2 we have e = log 3. Since for T > 2 s the machine 
goes into a computational loop, we always have 5 < logT. There- 
fore, every irreversible Turing machine using space S can be sim- 
ulated by a reversible machine using space S 2 in polynomial time. 
Let us note that it is possible to improve the situation by reversibly 
simulating only the irreversible steps. Call a quadruple of a Turing 
machine irreversible if its range overlaps with the range of another 
quadruple. A step of the computation is irreversible if it uses an 
irreversible quadruple. Let the number of irreversible steps in a T 
step computation be denoted by /. Clearly, I <T. The simulation 
results hold with T in the auxiliary space use replaced by /. In 
particular, 5' = 0(51og/). In many computations, / may be much 
smaller than T. There arises the problem of estimating the number 
of irreversible steps in a computation. (More complicatedly, one 
could extend the notion of irreversible step to those steps which 
can be reversed on local information alone. In some cases this is 
possible even when the used quadruple itself was irreversible.) 

We at some point conjectured that all reversible simulations of an 
irreversible computation can essentially be represented as the peb- 
ble game defined below, and that consequently the lower bound of 
Corollary|4]applies to all reversible simulations of irreversible com- 
putations. This conjecture was refuted in 1 12 1 using a technique due 
to 1211 to show that there exists a general reversible simulation of 
an irreversible computation using only order S space at the cost of 
using a thoroughly unrealistic simulation time exponential in S. 

In retrospect the conjecture was phrased too general: it should be 
restricted to useful simulations — using linear or slightly superlinear 
time and space simultaneously. The real question is whether there 
is a compiler that takes as input any irreversible algorithm A using 
S space and T time and produces a reversible algorithm B such that 
B(x) = A(x) for all input x and using T 1 = 0(T) time and 5' = O(S) 
space. In the extreme cases of time and space use this is possible: If 
S=&(T) then the simulation in [T | does the trick, and if T = &(2 S ) 
then the simulation of 1 12 works. For all other cases the pebble 
game analysis below has been used in [9] to show that any such 
simulation, if it exists, cannot relativize to oracles, or work in cases 
where the space bound is much less than the input length. (This 
is a standard method of giving evidence that the aimed-for result — 
here: simulation doesn't exist — is likely to be true in case the result 
itself is too hard to obtain.) 

5.3 Reversible Pebbling 

Let G be a linear list of nodes {1,2, .. . ,Tq}. We define a pebble 
game on G as follows. The game proceeds in a discrete sequence 
of steps of a single player. There are n pebbles which can be put on 
nodes of G. At any time the set of pebbles is divided in pebbles on 



nodes of G and the remaining pebbles which are called/ree pebbles. 
At every step either an existing free pebble can be put on a node of 
G (and is thus removed from the free pebble pool) or be removed 
from a node of G (and is added to the free pebble pool). Initially 
G is unpebbled and there is a pool of free pebbles. The game is 
played according to the following rule: 

Reversible Pebble Rule: If node i is occupied by a pebble, then 
one may either place a free pebble on node i + 1 (if it was not 
occupied before), or remove the pebble from node i + 1. 

We assume an extra initial node permanently occupied by an 
extra, fixed pebble, so that node 1 may be (un)pebbled at will. This 
pebble game is inspired by the method of simulating irreversible 
Turing Machines on reversible ones in a space efficient manner. 
The placement of a pebble corresponds to checkpointing the next 
state of the irreversible computation, while the removal of a pebble 
corresponds to reversibly erasing a checkpoint. Our main interest 
is in determining the number of pebbles k needed to pebble a given 
node i. 

The maximum number n of pebbles which are simultaneously on 
G at any one time in the game gives the space complexity nS of the 
simulation. If one deletes a pebble not following the above rules, 
then this means a block of bits of size S is erased irreversibly. This 
pebble game is inspired by the method of simulating irreversible 
Turing Machines on reversible ones in a space efficient manner. 
The placement of a pebble corresponds to checkpointing the current 
state of the irreversible computation, while the removal of a pebble 
corresponds to reversibly erasing a checkpoint. Our main interest 
is in determining the number of pebbles k needed to pebble a given 
node (. 

The maximum number n of pebbles which are simultaneously 
on G at any one time in the game gives the space complexity nS 
of the simulation. If one deletes a pebble not following the above 
rules, then this means a block of bits of size 5 is erased irreversibly. 
The limitation to Bennett's simulation is in fact space, rather than 
time. When space is limited, we may not have enough place to 
store garbage, and these garbage bits will have to be irreversibly 
erased. We establish a tight lower bound for any strategy for the 
pebble game in order to obtain a space-irreversibility tradeoff. 

5.4 Algorithm 

We describe the idea of Bennett's simulation |3|. This simula- 
tion is optimal 1171 among all reversible pebble games, and we will 
show the proof below. The total computation of T steps is broken 
into 2 k segments of length m = T2~ k . Every mth point of the com- 
putation is a node in the pebbling game; node i corresponding to im 
steps of computation. 

For each pebble a section of tape is reserved long enough to store 
the whole configuration of the simulated machine. By enlarging the 
tape alphabet, each pebble will require space only 5 + 0(1). 

Both the pebbling and unpebbling of a pebble t on some node, 
given that the previous node has a pebble s on it, will be achieved by 
a single reversible procedure bridge(.s,/). This looks up the config- 
uration at section s, simulates m steps of computation in a manner 
described in section|6| and exclusive-or's the result into section /. 
If t was a free pebble, meaning that its tape section is all zeroes, the 
result is that pebble t occupies the next node. If / already pebbled 
that node then it will be zeroed as a result. 

The essence of Bennett's simulation is a recursive subdivision of 
a computation path into 2 halves, which are traversed in 3 stages; 
the first stage gets the midpoint pebbled, the second gets the end- 
point pebbled, and the 3rd recovers the midpoint pebble. The fol- 
lowing recursive procedure implements this scheme; Pebble(i,f ,n) 



uses free pebbles 0, ...,«— 1 to compute the 2"th node after the 
one pebbled by s, and exclusive-or's that node with pebble / (either 
putting t on the node or taking it off). Its correctness follows by 
straightforward induction. Note that it is its own reverse; execut- 
ing it twice will produce no net change. The pebble parameters s 
and / are simply numbers in the range — 1 , 0, 1 , . . . , k. Pebble - 1 is 
permanently on node 0, pebble k gets to pebble the final node, and 
pebble i, for < i < k pebbles nodes that are odd multiples of 2' . 
The entire simulation is carried out with a call pebble(— 1 , k, k). 

pebble(i,f ,ri) 
{ 



if (w = 0) 




bridge^, t); 




fi (n = 0) 




if(«>0) 




let r = n — 1 




pebble(.s, r,n — 


i); 


pebb\e(r,t,n — 


i); 


pebble(.s, r,n — 


i) 


& (n > 0) 





} 

As noted by Bennett, both branches and merges must be labeled 
with mutually exclusive conditions to ensure reversibility. Recur- 
sion can be easily implemented reversibly by introducing an extra 
stack tape, which will hold at most n stack frames of size O(log)i) 
each, for a total of O(nlogn). 

5.5 Optimality 

This pebbling method is optimal in that no more than 2" +1 - 1 
steps can be bridged with n pebbles 1 17). It is easy to see that the 
method achieves this; the difficult part is showing that no reversible 
pebbling method can do better. It turns out that characterizing the 
maximum node that can be pebbled with a given number of peb- 
bles is best done by completely characterizing what pebble con- 
figurations are realizable. First we need to introduce some helpful 
notions. 

In a given pebble configuration with / free pebbles, a placed peb- 
ble is called available if there is another pebble at most 2' positions 
to its left (0 being the leftmost node). According to the above pro- 
cedures, an available pebble can be removed with the use of the 
free pebbles. For convenience we imagine this as a single big step 
in our game. 

Call a pebble configuration weakly solvable if there is a way of 
repeatedly removing an available pebble until all are free. Note that 
such configurations are necessarily realizable, since the removal 
process can be run in reverse to recreate the original configuration. 
Call a pebble configuration strongly solvable if all ways of repeat- 
edly removing an available pebble lead to all being free. Obviously 
any strongly solvable configuration is also weakly solvable. 

The starting configuration is obviously both weakly and strongly 
solvable. How does the single rule of the game affect solvabil- 
ity? Clearly, adding a pebble to a weakly solvable configuation 
yields another weakly solvable configuation, while removing a peb- 
ble from a strongly solvable configuation yields another strongly 
solvable configuation. It is not clear if removing a pebble from a 
weakly solvable configuation yields another one. If such is the case 
then we may conclude that all realizable configurations are weakly 
solvable and hence the two classes coincide. This is exactly what 
the next theorem shows. 

THEOREM 4. Every weakly solvable configuration is strongly 
solvable. 



PROOF. Let / be the number of free pebbles in a weakly solv- 
able configuration. Number the placed pebbles f,f + l,...,n— 1 
according to their order of removal. It is given that, for all i, peb- 
ble i has a higher-numbered pebble at most 2' positions to its left 
(number the fixed pebble at infinity). We know that pebble / is 
available. Suppose a pebble g with g > / is also available — so there 
must be a pebble at most 2^ positions to its left. It suffices to show 
that if pebble g is removed first, then pebbles /,/ + 1 , . . . , g — 1 
are still available when their turn comes. Suppose pebble j finds 
pebble g at most 2 J places to its left (otherwise j will still be avail- 
able after g's removal for sure). Then after removal of pebbles 
g, /, / + 1 , . . . , j — 1 , it will still find a higher-numbered pebble at 

most V + 2 f + 2 f + 2 f+ 1 H h V~ 1 < 2-> +1 places to its left, thus 

making it available given the extra now free pebble g. □ 

COROLLARY 3. A configuration with f free pebbles is real- 
izable if and only if its placed pebbles can be numbered f,f + 
l,...,n— 1 such that pebble i has a higher-numbered pebble at 
most 2' positions to its left. 

COROLLARY 4. The maximum reachable node with n pebbles 
is £™~o 2 l = 2" - 1. 

Moreover, if pebble(s,n) takes t(n) steps we find f(0) = 1 and 
t (n) = 3t(n - 1) + 1 = (3" +I - l)/2. That is, the number of steps 
T' c of a winning play of a pebble game of size Tq = 2" — 1 is Tq ~ 

1.53", that is, T'qK.T 1 ^. 

5.6 Trading Space for Erasures 

The simulation above follows the rules of the pebble game of 
length Tq = 2" — 1 with n pebbles above. A winning strategy for a 
game of length Tq using n pebbles corresponds with reversibly sim- 
ulating Tq segments of 5 steps of an irreversible computation using 
5 space such that the reversible simulator uses T 1 fa ST C fa ST^ 
steps and total space 5' = nS. The space 5' corresponds to the max- 
imal number of pebbles on G at any time during the game. The 
placement or removal of a pebble in the game corresponds to the 
reversible copying or reversible cancelation of a 'checkpoint' con- 
sisting of the entire instantaneous description of size 5 (work tape 
contents, location of heads, state of finite control) of the simulated 
irreversible machine. The total time TqS used by the irreversible 
computation is broken up in segments of size 5 so that the reversible 
copying and canceling of a checkpoints takes about the same num- 
ber of steps as the computation segments in between checkpoints. 
3 

We can now formulate a tradeoff between space used by a poly- 
nomial time reversible computation and irreversible erasures as pro- 
posed in 1 17 1 . First we show that allowing a limited amount of era- 
sure in an otherwise reversible computation means that we can get 
by with less work space. Therefore, we define an m-erasure pebble 
game as the pebble game above but with the additional rule 

• In at most m steps the player can remove a pebble from any 
node i > 1 without node i — 1 being pebbled at the time. 

An m-erasure pebble game corresponds with an otherwise re- 
versible computation using mS irreversible bit erasures, where 5 is 
the space used by the irreversible computation being simulated. 

3 If we are to account for the permanent pebble on node 0, we get 
that the simulation uses n + 1 pebbles for a pebble game with n 
pebbles of length Tq + 1. The simulation uses n+ 1 = 5' /S peb- 
bles for a simulated number of S(Tq + 1) steps of the irreversible 
computation. 



LEMMA 2. There is a winning strategy with n + 2 pebbles and 
m—\ erasures for pebble games G with Tq = m2 n , for all m> 1. 

PROOF. The strategy is to use 2 pebbles as springboards that are 
alternately placed 2" in front of each other using the remaining n 
pebbles to bridge the distance. The most backward springboard can 
be erased from its old position once all n pebbles are cleared from 
the space between it and the front springboard. 

The simulation time T' G is T' c fa 2m- 3" _I +2 fa 2m(r G /m) lo s 3 = 

2^1-1083^3 for j c = m2 n-l q 

Theorem 5 (Space-Irreversibility). (i) Pebble games G 
of size 2" — 1 can be won using n pebbles but not using n—\ peb- 
bles. 

( ii) IfG is a pebble game with a winning strategy using n pebbles 
without erasures, then there is also a winning strategy for G using 
E erasures and n — log(£ + 1) pebbles (for E is an odd integer at 
least 1 ). 

PROOF, (i) By Corollory|4] 

(ii) By (i), Tq = 2" — 1 is the maximum length of a pebble game 
G for which there is a winning strategy using n pebbles and no 
erasures. By Lemma [2] we can pebble a game G of length Tq = 
m2" _logm = 2" using n+ 1 — logm pebbles and 2m— 1 erasures. □ 

We analyze the consequences of Theorem [5] It is convenient 
to consider the special sequence of values E := 2 k+2 — 1 for k := 

0, 1, Let G be Bennett's pebble game of Lemma |4] of length 

Tq = 2"— 1. It can be won using n pebbles without erasures, or 
using n — k pebbles plus 2 k+2 — 1 erasures (which gives a gain over 
not erasing as in Lemma [4] only for k > 1), but not using n — 1 
pebbles. 

Therefore, we can exchange space use for irreversible erasures. 
Such a tradeoff can be used to reduce the space requirements of 
the reversible simulation. The correspondence between the erasure 
pebble game and the otherwise reversible computations using irre- 
versible erasures is that if the pebble game uses n — k pebbles and 
2 k+2 — 1 erasures, then the otherwise reversible computation uses 
(;j — k)S space and erases (2 k+2 — 1)5 bits irreversibly. 

Therefore, a reversible simulation according to the pebble game 
of every irreversible computation of length T = (2" — 1)5 can be 
done using nS space using (r/5) log3 5 time, but is impossible us- 
ing in — 1)5 space. It can also be performed using (n — k)S space, 
(2*+ 2 - 1)5 irreversible bit erasures and 2(*+ 1 H 1 - log3 Hi (r/5) log3 5 
time. In the extreme case we use no space to store the history and 
erase about AT bits. This corresponds to the fact that an irreversible 
computation may overwrite its scanned symbol irreversibly at each 
step. 

DEFINITION 6. Consider a simulation according to the pebble 
game using 5' storage space and T time which reversibly computes 
y = (x,f(x)) from x in order to simulate an irreversible computa- 
tion using 5 storage space and T time which computes f(x) from 
x. The irreversible simulation cost B s (x.y) of the simulation is the 
number of irreversibly erased bits in the simulation (with the pa- 
rameters 5, T, T understood). 

If the irreversible simulated computation from x to f(x) uses T 
steps, then for 5' = nS and n = log(T/5) we have above treated 
the most space parsimonious simulation which yields B s (x,y) = 0, 
withy = (x,f(x)}. 

COROLLARY 5. Simulating a T = (2" — 1)5 step irreversible 
computation from x to f(x) using 5 space by a computation from x 
to y = (x,f(x)), the irreversible simulation cost satisfies: 



(i) B("- k ) s (x,y) < B nS (x,y) + [2 k+1 -\)S,forn>k> 1. 

(ii) B^"- l '> s (x,y) > B" s (x,y),forn> 1. 

For the most space parsimonious simulation with n = log(T/5) 
this means that 

B s(io g (T/s)-k) {x ^ y) < B «og(r/s) (jC)j) + {2 k+i _ l)S 

6. SPACE PARSIMONIOUS SIMULATION 

Lange, McKenzie and Tapp, 1121 . devised a reversible simula- 
tion, LMT- simulation for short, that doesn't use extra space, at the 
cost of using exponential time. Their main idea of reversibly sim- 
ulating a machine without using more space is by reversibly cy- 
cling through the configuration tree of the machine (more precisely 
the connected component containing the input configuration). This 
configuration tree is a tree whose nodes are the machine configura- 
tions and where two nodes are connected by an edge if the machine 
moves in one step from one configuration to the other. We con- 
sider each edge to consist of two half-edges, each adjacent to one 
configuration. 

The configuration tree can be traversed by alternating two per- 
mutations on half-edges: a swapping permutation which swaps the 
two half-edges constituting each edge, and a rotation permutation 
whose orbits are all the half-edges adjacent to one configuration. 
Both permutations can be implemented in a constant number of 
steps. For simplicity one assumes the simulated machine strictly 
alternates moving and writing transitions. To prevent the simula- 
tion from exceeding the available space 5, each pebble section is 
marked with special left and right markers f,$, which we assume 
the simulated machine not to cross. Since this only prevents cross- 
ings in the forward simulation, we furthermore, with the head on 
the left (right) marker, only consider previous moving transitions 
from the right (left). 

7. TIME-SPACE TRADEOFF 

A call pebble(.s,?,«) results in 3" calls to bridge(-,-)- Bennett 
chose the number of pebbles large enough (n = Q.(logT)) so that 
m becomes small, on the order of the space 5 used by the simu- 
lated machine. In that case bridge(j,f) is easily implemented with 
the help of an additional history tape of size m which records the 
sequence of transitions. Instead, 1 5 1 showed how to allow an arbi- 
trary choice of n and resort to the space efficient simulation of 1 12 1 
to bridge the pebbled checkpoints. A similar solution was arrived 
at later and independently in the unpublished manuscript 1 22 ]. 

To adapt the LMT simulation to our needs, we equip our simu- 
lating machine with one extra tape to hold the simulated configu- 
ration and another extra tape counting the difference between for- 
ward and backward steps simulated, m = 2" steps of computation 
can be bridged with a logm bits binary counter, incremented with 
each simulated forward step, and decremented with each simulated 
backward step. Having obtained the configuration m steps beyond 
that of pebble s, it is exclusive-or'd into section t and then the LMT 
simulation is reversed to end up with a zero counter and a copy of 
section s, which is blanked by an exclusive-or from the original. 

bridge(s, t) 
{ 

copy section s onto (blanked) simulation tape 
setup: goto enter; 
loopl: come from endloopl; 
simulate step with swap&rotate and adjust counter 
if (counter=0) 
rotate back; 



if (simulation tape = section s) 

enter: come from start; 
fi (simulation tape = section s) 
fi (counter=0) 

endloopl: if (counter !=m) goto loopl; 
exclusive-or simulation tape into section t 
if (counter !=m) 

loop2: come from endloop2; 
reverse-simulate step with anti-rotate&swap and adjust counter 
if (counter=0) 

rotate back; 

if (simulation tape = section s) goto exit; 
fi (counter=0) 
endloop2: goto loop2; 
exit: clear simulation tape using section s 

} 

7.1 Complexity Analysis 

Let us analyze the time and space used by this simulation. 

THEOREM 6. An irreversible computation using time T and 
space S can be simulated reversibly in time T' = 3 k 2°^ T ^ 2k ^S and 
space S' = S(l + O(k)), where k is a parameter that can be cho- 
sen freely < k < log T to obtain the required tradeoff between 
reversible time T and space S' . 

PROOF. (Sketch) Every invocation of the bridge() procedure takes 
time 0(2°Ws). That is, every configuration has at most 0(1) 
predecessor configurations where it can have come from (constant 
number of states, constant alphabet size and choice of direction). 
Hence there are < 2 '" 1 ' configurations to be searched and about 
as many potential start configurations leading in m moves to the 
goal configuration, and every tape section comparison takes time 
0(S). The pebbling game over 2 k nodes takes 3* (un)pebbling steps 
each of which is an invocation of bridge(). Filling in m = T/2 k 
gives the claimed time bound. Each of the k + 0(1) pebbles takes 
space 0(5), as does the simulation tape and the counter, giving the 
claimed total space. □ 

It is easy to verify that for some simulations the upper bound is 
tight. The boundary cases, k = gives the LMT-simulation using 
exponential time and no extra space, and k = log T gives Bennett's 
simulation using at most square space and subquadratic time. Tak- 
ing intermediate values of k we can choose to reduce time at the 
cost of an increase of space use and vice versa. In particular, spe- 
cial values k = log log T and k = \ff give the results using simul- 
taneously subexponential time and subquadratic space exhibited in 
the introduction. Eliminating k we obtain: 

COROLLARY 6. Let T,S,T' ,S' be as above. Then there is a 
reversible simulation that has the following tradeoff between simu- 
lation time T and extra simulation space S : 

T' = S3T2°( r / 2T ). 

7.2 Local Irreversible Actions 

Suppose we have an otherwise reversible computation contain- 
ing local irreversible actions. Then we need to reversibly simulate 
only the subsequence of irreversible steps, leaving the connecting 
reversible computation segments unchanged. That is, an irreversib- 
lity parsimonious computation is much cheaper to reversibly simu- 
late than an irreversibility hungry one. 



8. UNKNOWN COMPUTING TIME 

In the previous analysis we have tacitly assumed that the re- 
versible simulator knows in advance the number of steps T taken 
by the irreversible computation to be simulated. In this context one 
can distinguish on-line computations and off-line computations to 
be simulated. On-line computations are computations which inter- 
act with the outside environment and in principle keep running for- 
ever. An example is the operating system of a computer. Off-line 
computations are computations which compute a definite function 
from an input (argument) to an output (value). For example, given 
as input a positive integer number, compute as output all its prime 
factors. For every input such an algorithm will have a definite run- 
ning time. 

There is a well-known simple device to remove this dependency 
for batch computations without increasing the simulation time (and 
space) too much 1 17 1. Suppose we want to simulate a computa- 
tion with unknown computation time T . Then we simulate t steps 
of the computation with t running through the sequence of values 
2,2 2 ,2 3 , . . . For every value t takes on we reversibly simulate the 
first t steps of the irreversible computation. If T > t then the com- 
putation is not finished at the end of this simulation. Subsequently 
we reversibly undo the computation until the initial state is reached 
again, set t := 2t and reversibly simulate again. This way we con- 
tinue until t > T at which bound the computation finishes. The total 
time spent in this simulation is 

riogr] iS , 
T" <2 £ S3^"2 0(2 ' T) < 2T'. 

9. LOWER BOUND 

It is not difficult to show a simple lower bound 1 5 1 on the extra 
storage space required for general reversible simulation. We con- 
sider only irreversible computations that are halting computations 
performing a mapping from an input to an output. For convenience 
we assume that the Turing machine has a single binary work tape 
delemited by markers t, t that are placed 5 positions apart. Initially 
the binary input of length n is written left adjusted on the work tape. 
At the end of the computation the output is written left adjusted on 
the work tape. The markers are never moved. Such a machine 
clearly can perform every computation as long as S is large enough 
with respect to n. Assume that the reversible simulator is a similar 
model albeit reversible. The average number of steps in the com- 
putation is the uniform average over all equally likely inputs of n 
bits. 

THEOREM 7. To generally simulate an irreversible halting com- 
putation of a Turing machine as above using storage space S and 
T steps on average, on inputs of length n, by a general reversible 
computation using S' storage space and T steps on average, the 
reversible simulator Turing machine having q' states, requires triv- 
ially T' >T and S 1 >n + log T — 0(1) up to a logarithmic additive 
term. 

PROOF. There are 2" possible inputs to the irreversible compu- 
tation, the computation on every input using on average T steps. 
A general simulation of this machine cannot use the semantics 
of the function being simulated but must simulate every step of 
the simulated machine. Hence T' > T. The simulator being re- 
versible requires different configurations for every step of every- 
one of the simulated computations that is, at least 2"T configura- 
tions. The simulating machine has not more than q'2 s S' distinct 
configurations — 2 s distinct values on the work tape, q' states, and 
S' head positions for the combination of input tape and work tape. 



Therefore, q'2 s ' S' > 2"T. That is, q'S'2 s '-" > T which shows that 
5"-n-log5'>logr-logg'. □ 

For example, consider irreversible computations that don't use 
extra space apart from the space to hold the input, that is, 5 = n. 
An example is the computation of f(x) = 0. 

• If T is polynomial in n then 5' = n + fl(logn). 

• If T is exponential in n then S' = n + il(n) . 

Thus, in some cases the LMT-algorithm is required to use extra 
space if we deal with halting computations computing a function 
from input to output. In the final version of the paper 1121 the 
authors have added that their simulation uses some extra space for 
counting (essentially 0(5)) in case we require halting computations 
from input to output, matching the lower bound above for S = n 
since their simulation uses on average T 1 steps exponential in S. 

Optimality and Tradeoffs: The lower bound of Theorem Q is 
optimal in the following sense. As one extreme, the LMT-algorithm 
of 1 12 1 discussed above uses S =n + log T space for simulating ir- 
reversible computations of total functions on inputs of n bits, but at 
the cost of using T 1 = £l(2 s ) simulation time. As the other extreme, 
Bennett's simple algorithm in 1 1 1 uses T' = 0(T) reversible sim- 
ulation time, but at the cost of using 5' = Sl(T) additional storage 
space. This implies that improvements in determining the complex- 
ity of reversible simulation must consider time-space tradeoffs. 
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